Picture for Yelong Shen

Yelong Shen

Latent Recurrent Transformer: Architecture Exploration, Training Strategies, and Scaling Behavior

Add code
May 26, 2026
Viaarxiv icon

Robust LLM Watermarking with Minimal Semantic Distortion for IP Protection

Add code
May 22, 2026
Viaarxiv icon

Orchard: An Open-Source Agentic Modeling Framework

Add code
May 14, 2026
Viaarxiv icon

Shuffle the Context: RoPE-Perturbed Self-Distillation for Long-Context Adaptation

Add code
Apr 15, 2026
Viaarxiv icon

Rethinking Language Model Scaling under Transferable Hypersphere Optimization

Add code
Mar 30, 2026
Viaarxiv icon

Reinforcement World Model Learning for LLM-based Agents

Add code
Feb 05, 2026
Viaarxiv icon

Test-time Recursive Thinking: Self-Improvement without External Feedback

Add code
Feb 03, 2026
Viaarxiv icon

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Add code
Feb 02, 2026
Viaarxiv icon

RLBR: Reinforcement Learning with Biasing Rewards for Contextual Speech Large Language Models

Add code
Jan 19, 2026
Viaarxiv icon

Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization

Add code
Sep 30, 2025
Viaarxiv icon